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Abstract. We completely classify the computational complexity of the list H-colouring 
problem for graphs (with possible loops) in combinatorial and algebraic terms: for every 
graph H the problem is either NP-complete, NL-complete, L-complete or is first-order de- 
finable; descriptive complexity equivalents are given as well via Datalog and its fragments. 
Our algebraic characterisations match important conjectures in the study of constraint 
satisfaction problems. 



1. Introduction 

Homomorphisms of graphs, i.e. edge-preserving mappings, generalise graph colourings, 
and can model a wide variety of combinatorial problems dealing with mappings and assign- 
ments [T7] . Because of the richness of the homomorphism framework, many computational 
aspects of graph homomorphisms have recently become the focus of much attention. In 
the list H- colouring problem (for a fixed graph H), one is given a graph G and a list L v 
of vertices of H for each vertex u in G, and the goal is to determine whether there is a 
homomorphism h from G to H such that h(v) € L v for all v. The complexity of such 
problems has been studied by combinatorial methods, e.g., in |X3|, I14j. In this paper, we 
study the complexity of the list homomorphism problem for graphs in the wider context of 
classifying the complexity of constraint satisfaction problems (CSP), see [31 1151 ITS]. It is 
well known that the CSP can be viewed as the problem of deciding whether there exists a 
homomorphism from a relational structure to another, thus naturally extending the graph 
homomorphism problem. 

Key words and phrases: graph homomorphism, constraint satisfaction problem, complexity, universal 
algebra, Datalog. 
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One line of CSP research studies the non-uniform CSP, in which the target (or template) 
structure T is fixed and the question is whether there exists a homomorphism from an 
input structure to T. Over the last years, much work has been done on classifying the 
complexity of this problem, denoted Hom(T) or CSP(T), with respect to the fixed target 
structure, see surveys [71 El [18]. Classification here is understood with respect to both 
computational complexity (i.e. membership in a given complexity class such as P, NL, or 
L, modulo standard assumptions) and descriptive complexity (i.e. definability of the class 
of all positive, or all negative, instances in a given logic). 

The best-known classification results in this direction concern the distinction between 
polynomial-time solvable and NP-complete CSPs. For example, a classical result of Hell and 
Nesetfil (see [171118] ) shows that, for a graph H, Hom(H) (aka H-colouring) is tractable if H 
is bipartite or admits a loop, and is NP-complete otherwise, while Schaefer's dichotomy |24| 
proves that any Boolean CSP is either in P or NP-complete. Recent work [1] established a 
more precise classification in the Boolean case: if T is a structure on {0, 1} then CSP(T) 
is either NP-complete, P-complete, NL-complete, ©L-complete, L-complete or in AC . 

Much of the work concerning the descriptive complexity of CSPs is centred around 
the database-inspired logic programming language Datalog and its fragments (see [61 El [12], 
120]). Feder and Vardi initially showed |15j that a number of important tractable cases 
of CSP(T) correspond to structures for which -iCSP(T) (the complement of CSP(T)) is 
definable in Datalog. Similar ties were uncovered more recently between the two fragments 
of Datalog known as linear and symmetric Datalog and structures T for which CSP(T) 
belongs to NL and L, respectively [9"1 [12]. 

Algebra, logic and combinatorics provide three angles of attack which have fueled 
progress in this classification effort [U [7j El [171 [TBI [20] . The algebraic approach (see [TJ E] ) 
links the complexity of CSP(T) to the set of functions that preserve the relations in T. 
In this framework, one associates to each T an algebra At and exploits the fact that the 
properties of At completely determine the complexity of CSP(T). This angle of attack was 
crucial in establishing key results in the field (see, for example, [21 El [7]). 

Tame Congruence Theory, a deep universal-algebraic framework first developed by 
Hobby and McKenzie in the mid 80's |19| . classifies the local behaviour of finite alge- 
bras into five types (unary, affine, Boolean, lattice and semilattice.) It was recently shown 
(see [6j [T], [22] ) that there is a strong connection between the computational and descriptive 
complexity of CSP(T) and the set of types that appear in At and its subalgebras. There are 
strong conditions involving types which are sufficient for NL-hardness, P-hardness and NP- 
hardness of CSP(T) as well as for inexpressibility of -iCSP(T) in Datalog, linear Datalog 
and symmetric Datalog. These sufficient conditions are also suspected (and in some cases 
proved) to be necessary, under natural complexity-theoretic assumptions. For example, (a) 
the presence of unary type is known to imply NP-completeness, while its absence is conjec- 
tured to imply tractability (see [7J); (b) the absence of unary and affine types was recently 
proved to be equivalent to definability in Datalog [2J; (c) the absence of unary, affine, and 
semilattice types is proved necessary, and suspected to be sufficient, for membership in NL 
and definability in linear Datalog [22]; (d) the absence of all types but Boolean is proved 
necessary, and suspected to be sufficient, for membership in L and definability in symmet- 
ric Datalog [22]. The strength of evidence varies from case to case and, in particular, the 
conjectured algebraic conditions concerning CSPs in NL and L (and, as mentioned above, 
linear and symmetric Datalog) still rest on relatively limited evidence [61 El HH HUJ [22] . 
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The aim of the present paper is to show that these algebraic conditions are indeed 
sufficient and necessary in the special case of list H-colouring for undirected graphs (with 
possible loops), and to characterise, in this special case, the dividing lines in graph-theoretic 
terms (both via forbidden subgraphs and through an inductive definition). One can view the 
list H-colouring problem as a CSP where the template is the structure H L consisting of the 
binary (edge) relation of H and all unary relations on H (i.e. every subset of H). Tractable 
list homomorphism problems for general structures were characterised in [5] in algebraic 
terms. The tractable cases for graphs were described in p3] in both combinatorial and 
(more specific) algebraic terms; the latter implies, when combined with a recent result |10j . 
that in these cases -i CSP(H^) definable in linear Datalog and therefore CSP(H L ) is in fact 
in NL. We complete the picture by refining this classification and showing that CSP(H L ) 
is either NP-complete, or NL-complete, or L-complete or in AC (and in fact first-order 
definable). We also remark that the problem of recognising into which case the problem 
CSP(H^) falls can be solved in polynomial time. 

As we mentioned above, the distinction between NP-complete cases and those in NL 
follows from earlier work [14J, and the situation is similar with distinction between L-hard 
cases and those leading to membership in AC |21l 122] . Therefore, the main body of 
technical work in the paper concerns the distinction between NL-hardness and membership 
in L. We give two equivalent characterisations of the class of graphs H such that CSP(H L ) 
is in L. One characterisation is via forbidden subgraphs (for example, the reflexive graphs 
in this class are exactly the (-P4, C4)-free graphs, while the irreflexive ones are exactly the 
bipartite (i-6> C6)-free graphs), while the other is via an inductive definition. The first 
characterisation is used to show that graphs outside of this class give rise to NL-hard 
problems; we do this by providing constructions witnessing the presence of a non-Boolean 
type in the algebras associated with the graphs. The second characterisation is used to 
prove positive results. We first provide operations in the associated algebra which satisfy 
certain identities; this allows us to show that the necessary condition on types is also 
sufficient in our case. We also use the inductive definition to demonstrate that the class 
of negative instances of the corresponding CSP is definable in symmetric Datalog, which 
implies membership of the CSP in L. 

2. Preliminaries 

2.1. Graphs and relational structures 

In the following we denote the underlying universe of a structure S, T, ... by its 
roman equivalent S, T, etc. A signature is a (finite) set of relation symbols with associated 
arities. Let T be a structure of signature r; for each relation symbol R G r we denote the 
corresponding relation of T by -R(T). Let S be a structure of the same signature. A homo- 
morphism from S to T is a map / from S to T such that f(R(S)) C R(T) for each R € r. 
In this case we write / : S — > T. A structure T is called a core if every homomorphism 
from T to itself is a permutation on T. We denote by CSP(T) the class of all r-structures 
S that admit a homomorphism to T, and by -1 CSP(T) the complement of this class. 

The direct n-th power of a r-structure T, denoted T n , is defined to have universe T n 
and, for any (say m-ary) R e r, (a.%, . . . , a m ) € R(T n ) if and only if (ai[i], ... ,a m [i]) € -R(T) 
for each 1 < i < n. For a subset I C.T, the substructure induced by I on T is the structure 
I with universe I and such that R(J) = -R(T) n I m for every m-ary R 6 r. 
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For the purposes of this paper, a graph is a relational structure H = (H; 8) where 9 
is a symmetric binary relation on H. The graph H is reflexive (irreflexive) if (x, x) E 8 
((x,x) 8) for all x € H. Given a graph H, let Si, ■■■ ,Sk denote all subsets of H; let 
H L be the relational structure obtained from H by adding all the Si as unary relations; 
more precisely, let r be the signature that consists of one binary relational symbol 8 and 
unary symbols i = l,...,k. The r-structure H L has universe H, 8(H L ) is the edge 
relation of H, and Ri(H L ) = S{ for all i = 1, . . . , k. It is easy to see that U L is a core. 
We call CSP(H^) the list homomorphism problem for H. Note that if G is an instance of 
this problem then 8(G) can be considered as a digraph, but the directions of the arcs are 
unimportant because H is undirected. Also, if an element v 6 G is in Ri(G) then this is 
equivalent to v having Si as its list, so G can be thought of as a digraph with H-lists. 

In p3], a dichotomy result was proved, identifying bi-arc graphs as those whose list 
homomorphism problem is tractable, and others as giving rise to NP-complete problems. 
Let C be a circle with two specified points p and q. A bi-arc is a pair of arcs (iV, S) such 
that N contains p but not q and S contains q but not p. A graph H is a bi-arc graph if 
there is a family of bi-arcs {(N X ,S X ) : x € H} such that, for every x,y £ H, the following 
hold: (i) if x and y are adjacent, then neither N x intersects S y nor N y intersects S x , and 
(ii) if x is not adjacent to y then both N x intersects S y and N y intersects S x . 

2.2. Algebra 

An n-ary operation on a set A is a map / : A n — >• A, a projection is an operation of 
the form e % n (xi, . . . ,x n ) = Xi for some 1 < i < n. Given an /i-ary relation 8 and an n-ary 
operation / on the same set A, we say that / preserves 8 or that 8 is invariant under / if 
the following holds: given any matrix M of size h x n whose columns are in 8, applying / 
to the rows of M will produce an /i-tuple in 8. 

A polymorphism of a structure T is an operation / that preserves each relation in T; 
in this case we also say that T admits f. In other words, an n-ary polymorphism of T is 
simply a homomorphism from T™ to T. With any structure T, one associates an algebra At 
whose universe is T and whose operations are all polymorphisms of T. Given a graph H, we 
let EI denote the algebra associated with H L . An operation on a set is called conservative 
if it preserves all subsets of the set (as unary relations). So, the operations of EI are the 
conservative polymorphisms of H. Polymorphisms can provide a convenient language when 
defining classes of graphs. For example, it was shown in [4] that a graph is a bi-arc graph 
if and only if it admits a conservative majority operation where a majority operation is a 
ternary operation m satisfying the identities m(x, x, y) = m(x, y, x) = m(y, x, x) = x. 

In order to state some of our results, we will need the notions of a variety and a term 
operation. Let 7 be a signature, i.e. a set of operation symbols / each of a fixed arity (we 
use the term "signature" for both structures and algebras, this will cause no confusion). An 
algebra of signature I is a pair A = (A; F) where A is a non-empty set (the universe of A) 
and F = {/ A : / € J} is the set of basic operations (for each / € I, / A is an operation on 
A of the corresponding arity). The term operations of A are the operations built from the 
operations in F and projections by using composition. An algebra all of whose (basic or 
term) operations are conservative is called a conservative algebra. A class of similar algebras 
(i.e. algebras with the same signature) which is closed under formation of homomorphic 
images, subalgebras and direct products is called a variety. The variety generated by an 
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algebra A is denoted by V(A), and is the smallest variety containing A, i.e. the class of all 
homomorphic images of subalgebras of powers of A. 

Tame Congruence Theory, as developed in [19], is a powerful tool for the analysis of 
finite algebras. Every finite algebra has a typeset, which describes (in a certain specified 
sense) the local behaviour of the algebra. It contains one or more of the following 5 types: 
(1) the unary type, (2) the affine type, (3) the Boolean type, (4) the lattice type and (5) 
the semilattice type. The numbering of the types is fixed, and they are often referred to 
by their numbers. Simple algebras, i.e. algebras without non-trivial proper homomorphic 
images, admit a unique type; the prototypical examples are: any 2-element algebra whose 
basic operations are all unary has type 1. A finite vector space has type 2. The 2-element 
Boolean algebra has type 3. The 2-element lattice is the 2-element algebra with two binary 
operations ({0, 1}; V, A): it has type 4. The 2-element semilattices are the 2-element algebras 
with a single binary operation ({0,1}; A) and ({0, 1}; V): they have type 5. The typeset 
of a variety V, denoted typ(V), is simply the union of typesets of the algebras in it. We 
will be mostly interested in type-omitting conditions for varieties of the form V(At), and 
Corollary 3.2 of [25] says that in this case it is enough to consider the typesets of Ax and 
its subalgebras. 

On the intuitive level, if T is a core structure then the typeset iyp(V(Ax)) contains 
crucial information about the kind of relations that T can or cannot simulate, thus implying 
lower/upper bounds on the complexity of CSP(T). For our purposes here, it will not be 
necessary to delve further into the technical aspects of types and typesets. We only note 
that there is a very tight connection between the kind of equations that are satisfied by 
the algebras in a variety and the types that are admitted or omitted by a variety, i.e. those 
types that do or do not appear in the typesets of algebras in the variety [19]. 

In this paper, we use ternary operations fx, ■ ■ ■ ,f n satisfying the following identities: 

x = fi(x,y,y) (2.1) 
fi(x,x,y) = fi+i(x,y,y) for alH = l,...n- 1 (2.2) 
f n (x,x,y) = y. (2.3) 
The following lemma contains some type-omitting results that we use in this paper. 

Lemma 2.1. [19] A finite algebra A has term operations fi, ■ ■ ■ , f n , for some n > 1, 
satisfying identities \2.1\) - [2~3\) if and only if the variety V(A) omits types 1, 4 and 5. 
If a finite algebra A has a majority term operation then V(A) omits types 1, 2 and 5. 

We remark in passing that operations satisfying identities (|2.ip — f)2.3p are also known 
to characterise a certain algebraic (congruence) condition called (n + l)-permutability [19] . 

2.3. Datalog 

Datalog is a query and rule language for deductive databases (see [20 1). A Datalog 
program T> over a (relational) signature r is a finite set of rules of the form h ^— b\f\ . . . Ab m 
where h and each bi are atomic formulas Rj{v\, We say that h is the head of the rule 

and that b\ A . . . A b m is its body. Relational predicates Rj which appear in the head of some 
rule of T> are called intensional database predicates (IDBs) and are not part of the signature 
r. All other relational predicates are called extensional database predicates (EDBs) and are 
in r. So, a Datalog program is a recursive specification of IDBs (from EDBs). 
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A rule of T> is linear if its body contains at most one IDB and is non-recursive if its 
body contains only EDBs. A linear but recursive rule is of the form I\{x) ^— ^(y) A 
E\{z\) A ... A Ek{zk) where I\,l2 are IDBs and the Ei are EDBs (note that the variables 
occurring in x,y,Zi are not necessarily distinct). Each such rule has a symmetric /2G/) ^~ 
I\{x) A E\(zi) A . . . A Ek(zk). A Datalog program is non-recursive if all its rules are non- 
recursive, linear if all its rules are linear and symmetric if it is linear and if the symmetric 
of each recursive rule of T> is also a rule of T>. 

A Datalog program T> takes a r-structure A as input and returns a structure T>(A.) 
over the signature r' = r U {I : I is an IDB in D}. The relations corresponding to r are 
the same as in A, while the new relations are recursively computed by T> , with semantics 
naturally obtained via least fixed-point of monotone operators. We also want to view a 
Datalog program as being able to accept or reject an input r-structure and this is achieved 
by choosing one of the IDBs of V as the goal predicate: the r-structure A is accepted by 
T> if the goal predicate is non-empty in D(A). Thus every Datalog program with a goal 
predicate defines a class of structures - those that are accepted by the program. 

When using Datalog to study CSP(T), one usually speaks of the definability of -i CSP(T) 
in Datalog (i.e. by a Datalog program) or its fragments (because any class definable in Dat- 
alog must be closed under extension). Examples of CSPs definable in Datalog and its 
fragments can be found, e.g., in [HI [12]. As we mentioned before, any problem CSP(T) is 
tractable if its complement is definable in Datalog, and all such structures were recently iden- 
tified in |2j. Definability of -iCSP(T) in linear (symmetric) Datalog implies that CSP(T) 
belongs to NL and L, respectively [T2] . As we discussed in Section [IJ there is a connection 
between definability of CSPs in Datalog (and its fragments) and the presence/absence of 
types in the corresponding algebra (or variety). 

Note that it follows from Lemma 12.11 and from the results in [22[ [26] that if, for a core 
structure T, -iCSP(T) is definable in symmetric Datalog then T must admit, for some n, 
operations satisfying identities fj2 . 1 1) — fj2 . 3^ . Moreover, with the result of [2j, a conjecture 
from [22J can be restated as follows: for a core structure T, if -iCSP(T) is definable in 
Datalog and, for some n, T admits operations satisfying (12. 1|) — f)2.3[) . then -iCSP(T) is 
definable in symmetric Datalog. This conjecture is proved in [TTj for n = 1. 

3. A class of graphs 

In this section, we give combinatorial characterisations of a class of graphs whose list 
homomorphism problem will turn out to belong to L. 

Let Hi and H2 be bipartite irreflexive graphs, with colour classes B\, T\ and B2 and T2 
respectively, with T\ and B2 non-empty. We define the special sum Hi ©H2 (which depends 
on the choice of the Bi and Tj) as follows: it is the graph obtained from the disjoint union 
of Hi and H2 by adding all possible edges between the vertices in T\ and -E>2- Notice that 
we can often decompose a bipartite graph in several ways, and even choose B\ or T2 to be 
empty. We say that an irreflexive graph H is a special sum or expressed as a special sum 
if there exist two bipartite graphs and a choice of colour classes on each such that H is 
isomorphic to the special sum of these two graphs. 

Definition 3.1. Let K, denote the smallest class of irreflexive graphs containing the one- 
element graph and closed under (i) special sum and (ii) disjoint union. We call the graphs 
in IC basic irreflexive. 
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Figure 1: The forbidden mixed graphs. 



The following result gives a characterisation of basic irreflexive graphs in terms of 
forbidden subgraphs: 

Lemma 3.2. Let H be an irreflexive graph. Then the following conditions are equivalent: 

(1) H is basic irreflexive; 

(2) H is bipartite, contains no induced 6- cycle, nor any induced path of length 5. 

We shall now describe our main family of graphs, first by forbidden induced subgraphs, 
and then in an inductive manner. 

Definition 3.3. Define the class C of graphs as follows: a graph H belongs to C if it 
contains none of the following as an induced subgraph: 

(1) the reflexive path of length 3 and the reflexive 4-cycle; 

(2) the irreflexive cycles of length 3, 5 and 6, and the irreflexive path of length 5; 

(3) Bl, B2, B3, B4, B5 and B6 (see Figured)) 

We will now characterise the class C in an inductive manner. 

Definition 3.4. A connected graph H is basic if either (i) H is a single loop, or (ii) H is a 
basic irreflexive graph, or (iii) H is obtained from a basic irreflexive graph Hi with colour 
classes B and T by adding every edge (including loops) of the form {t, t'} where t, t' € T. 

Definition 3.5. Given two vertex-disjoint graphs Hi and H2, the adjunction of Hi to H2 
is the graph Hi H2 obtained by taking the disjoint union of the two graphs, and adding 
every edge of the form {x, y} where a; is a loop in Hi and y is a vertex of H2. 

Lemma 3.6. Let £.r denote the class of reflexive graphs in C Then £r is the smallest 
class V of reflexive graphs such that: 

(1) T> contains the one- element graph; 

(2) T> is closed under disjoint union; 

(3) if Hi is a single loop and H2 € T> then Hi H2 € T>. 

Lemma 13.61 states that the reflexive graphs avoiding the path of length 3 and the 4- 
cycle are precisely those constructed from the one-element loop using disjoint union and 
adjunction of a universal vertex. These graphs can also be described by the following 
property: every connected induced subgraph of size at most 4 has a universal vertex. These 
graphs have been studied previously as those with NLCT width 1, which were proved to 
be exactly the trivially perfect graphs [16]. Our result provides an alternative proof of the 
equivalence of these conditions. 



Theorem 3.7. The class C is the smallest class C of graphs such that: 
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(1) C contains the basic graphs; 

(2) C is closed under disjoint union; 

(3) i/Hi is a basic graph and H2 G C then Hi H2 G C. 

Proof. We start by showing that every basic graph is in C, i.e. that a basic graph does not 
contain any of the forbidden graphs. If H is a single loop or a basic irreflexive graph, then 
this is immediate. Otherwise H is obtained from a basic irreflexive graph Hi with colour 
classes B and T by adding every edge of the form {t\,t2) where t{ 6 T. In particular, the 
loops form a clique and no edge connects two non-loops; it is clear in that case that H 
contains none of Bl, B2, B3, B4. On the other hand if H contains B5 or B6, then Hi 
contains the path of length 5 or the 6-cycle, contradicting the fact that Hi is basic. 

Next we show that C is closed under disjoint union and adjunction of basic graphs. 
It is obvious that the disjoint union of graphs that avoid the forbidden graphs will also 
avoid these. So suppose that an adjunction Hi H2, where Hi is a basic graph, contains 
an induced forbidden graph B whose vertices are neither all in H\ nor H2] without loss 
of generality H\ contains at least one loop, its loops form a clique and none of its edges 
connects two non-loops. It is then easy to verify that B contains both loops and non-loops. 
Because the other cases are similar, we prove only that B is not B3: since vertex d is not 
adjacent to a it must be in H2, and similarly for c. Since b is not adjacent to d it must also 
be in H2; since non-loops of Hi are not adjacent to elements of H2 it follows that a is in 
H2 also, a contradiction. 

Now we must show that every graph in C can be obtained from the basic graphs by 
disjoint union and adjunction of basic graphs. Suppose this is not the case. If H is a 
counterexample of minimum size, then obviously it is connected, and it contains at least 
one loop for otherwise it is a basic irreflexive graph. By Lemma 13.61 H also contains at 
least one non-loop. 

For a £ H let N(a) denote its set of neighbours. Let R(H) denote the subgraph of H 
induced by its set R(H) of loops, and let J(H) denote the subgraph induced by J(H), the 
set of non-loops of H. Since H is connected and neither Bl nor B2 is an induced subgraph 
of H, the graph R(H) is also connected, and furthermore every vertex in J(H) is adjacent 
to some vertex in R(H). By Lemma l3.6l we know that R(H) contains at least one universal 
vertex: let U denote the (non-empty) set of universal vertices of R(H). Let J denote the 
set of all a G J{H) such that N(a) n R(H) C U. Let us show that J ^ 0. For every u £ U, 
there is w 6 J{H) not adjacent to u because otherwise H is obtained by adjoining u to the 
rest of H, a contradiction with the choice of H. If this w has a neighbour r G R(H)\U then 
there is some s G R{H) \ U not adjacent to r, and the graph induced by {w, u, s, r} contains 
B2 or B3, a contradiction. Hence, w G J. Let S denote the subgraph of H induced by 
U U J. The graph S is connected. We claim that the following properties also hold: 

(1) if a and b are adjacent non-loops, then N(a) fl U = N(b) H U ; 

(2) if a is in a connected component of the subgraph of S induced by J with more than 
one vertex, then for any other b G J, one of N(a) nU,N(b) C\U contains the other. 

The first statement holds because Bl is forbidden, and the second follows from the first 
because B4 is also forbidden. Let J%, . . . , denote the different connected components of 
J in S. By (1) we may let N(Ji) denote the set of common neighbours of members of J{ in 
U. By (2), we can re-order the Jj's so that for some 1 < m < k we have iV(Jj) C N(Jj) for 
all % < m and all j > m, and, in addition, we have m = 1 or |Ji| = 1 for all 1 < % < m. Let 
B denote the subgraph of S induced by B = \J™ =1 {J% U N(J{)), and let C be the subgraph 
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of H induced by H \ B. We claim that H = B C. For this, it suffices to show that every 
element in U£i N(Ji) is adjacent to every non-loop c G C. By construction this holds if 
c £ J (1 C . Now suppose this does not hold: then some x £ J(H) \ J is not adjacent to 
some y £ N(Ji) for some i < m. Since x J we may find some 2 € R{H) \ U adjacent to 
x; it is of course also adjacent to y. Since z £ U there exists some z' £ R{H) \ U that is not 
adjacent to z, but it is of course adjacent to y. If x is adjacent to z', then {x, z, z'} induces 
a subgraph isomorphic to B2, a contradiction. Otherwise, {x, z,y, z'} induces a subgraph 
isomorphic to B3, also a contradiction. 

If every Jj with i <m contains a single element, notice that B is a basic graph: indeed, 
removing all edges between its loops yields a bipartite irreflexive graph which contains 
neither the path of length 5 nor the 6-cycle, since B contains neither B5 nor B6. Since 
this contradicts our hypothesis on H, we conclude that m = 1. But this means that N(J\) 
is a set of universal vertices in H. Let u be such a vertex and let D denote its complement 
in H: clearly H is obtained as the adjunction of the single loop u to D, contradicting our 
hypothesis. This concludes the proof. ■ 

4. Classification results 

Recall the standard numbering of types: (1) unary, (2) affine , (3) Boolean, (4) lattice 
and (5) semilattice. We will need the following auxiliary result (which is well known). Note 
that the assumptions of this lemma effectively say that CSP(T) can simulate the graph 
A;-colouring problem (with k = \U\) or the directed si-connectivity problem. 

Lemma 4.1. Let S, T be structures, let s±,S2 £ S, and let R = {(/(si), f{s2)) \ f ■ S — > T}. 

(1) If R = {{x,y) £ U 2 | x / y} for some subset U C T with \U\ > 3 then V(A T ) 
admits type 1. 

(2) If R = {(t,t), (t,t'), (t' ,t')} for some distinct t,t' £ T then V(At) admits at least 
one of the types 1, 4, 5. 

Proof [sketch] : The assumption of this lemma implies that At has a subalgebra (induced by 
U and {t,t'}, respectively) such that all operations of the subalgebra preserve the relation 
R. It is well-known (see, e.g., [17]) that all operations preserving the disequality relation 
on U are essentially unary, while it is easy to check that the order relation on a 2-element 
set cannot admit operations satisfying identities (|2.ip - (|2.3p . so one can use Lemma l2.1i ■ 

The following lemma connects the characterisation of bi-arc graphs given in [3] with a 
type-omitting condition. 

Lemma 4.2. Let H be a graph. Then the following conditions are equivalent: 

(1) the variety V(H) omits type 1; 

(2) the graph H admits a conservative majority operation; 

(3) the graph H is a bi-arc graph. 

The results summarised in the following theorem are known (or easily follow from known 
results, with a little help from Lemma l4.2p . 

Theorem 4.3. Let HI be a graph. 

• If typ(V(M.)) admits type 1, then -iCSP(H L ) is not expressible in Datalog and 
CSP(H L ) is NP- complete (under first- order reductions); 
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• if typ(y(M)) omits type 1 but admits type 4 then -iCSP(H L ) is not expressible in 
symmetric Datalog but is expressible in linear Datalog, and CSP(H L ) is Nh-complete 
(under first- order reductions.) 

Proof. The first statement is shown in [22]. If the variety omits type 1, then admits 
a majority operation by Lemma 14.21 and then -iCSP(H L ) is expressible in linear Datalog 
by |10j : in particular the problem is in NL. If, furthermore, the variety admits type 4, then 
-iCSP(H L ) is not expressible in symmetric Datalog and is NL-hard by results in [22]. ■ 

By Lemma \2.1\ the presence of a majority operation in EI implies that typ(V(M)) can 
contain only types 3 and 4. Type 4 is dealt with in Theorem l4.3| so it remains to investigate 
graphs H with typ(V(M)) = {3}. 

The next theorem is the main result of this paper. 

Theorem 4.4. Let H be a graph. Then the following conditions are equivalent: 

(1) H admits conservative operations satisfying \2. l\) - [273\) for n = 3; 

(2) H admits conservative operations satisfying jj?.i)) - pO|) for some n > 1; 

(3) typ(V(EQ) = {3}; 

(4) H G C; 

(5) -iCSP(H i ) is definable in symmetric Datalog. 

If the above holds then CSP(H i ) is in the complexity class L. 

Proof [sketch]: (1) =4> (2) is trivial. If (2) holds then by Lemma 12 . 1 1 V (H) omits types 1, 4, 
and 5. By Lemma f4.2t H admits a majority operation, so Lemma [2 . 1 1 implies that V(H) also 
omits type 2; hence (3) holds. Implication (3)=^(4) is the content of Lemma 14.51 below, and 
(5) implies (3) by a result of [22]. By using Theorem 13. 7\ one can show that (4) implies both 
(1) and (5). Finally, definability in symmetric Datalog implies membership in L by [12]. ■ 

Lemma 4.5. J/H £ £ then typ(V(M)) ^ {3}. 

Proof. By Theorem 9.15 of [19J, typ(V(M)) = {3} if and only if H admits a sequence 
of conservative operations satisfying certain identities (in the spirit of f)2. 1 j) — (|2.3[) ) . By 
conservativity, such operations can be restricted to any subset of H while satisfying the 
same identities, so the property typ(V(M)) = {3} is inherited by induced subgraphs. It 
follows that it is enough to prove this lemma for the forbidden graphs from Definition 13.31 

For the irreflexive odd cycles, the lemma follows immediately from the main results 
of [3l [23]. The proof of Theorem 3.1 of [13] shows that the conditions of Lemma I4.1T 1) are 
satisfied by (some S, si,S2 and) T = F L where F is the irreflexive 6-cycle. One can check 
that the reflexive 4-cycle is not a bi-arc graph, so we can apply Lemma 14.21 in this case. 

For the remaining forbidden graphs F from Definition 13.31 we use Lemma 14.1( 2) with 
T = F L . In each case, the binary relation of the structure S will be a short undirected 
path, and s\, S2 will be the endpoints of the path. We will represent such a structure S by a 
sequence of subsets of F (indicating lists assigned to vertices of the path) . It can be easily 
checked that, in each case, the relation R defined as in Lemma 14.11 is of the required form. 

If F is the reflexive path of length 3, say a — b — c — d, then S = ac — be — ad — ac. If F 
is the irreflexive path of length 5, say a — b — c — d — e — f then S = ae — bd — ce — bf — ae. 
For graphs Bl — B6, we use notation from Fig. [TJ For Bl, S = be — be — ab — ab — be. For 
B2, S = be — ac — ab — be. For B3, S = be — ad — bd — be. For B4, S = ae — bd — cd — ae. 
Finally, for both B5 and B6, S = ac — b'c' — ab — a'c' — ac. m 
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For completeness' sake, we describe graphs whose list homomorphism problem is de- 
finable in first-order logic (equivalently, is in AC , see [6].) By results in [22J, any problem 
CSP(T) is either first-order definable or L-hard under FO reductions. Hence, it follows 
from Theorem 14.41 that, for a graph H € C, the list homomorphism problem for H is either 
first-order definable or L-complete. 

We need the following characterisation of structures whose CSP is first-order definable 
[21j . Let T be a relational structure and let a,b G T. We say that b dominates a in T if for 
any relation R of T, and any tuple t £ R, replacement of any occurrence of a by b in t will 
yield a tuple of R. Recall the definition of a direct power of a structure from Subsection l2.1i 
If T is a relational structure, we say that the structure T 2 dismantles to the diagonal if 
there exists a sequence of elements {ao, • • • , a n } = T 2 \ {(a, a) : a £ T} such that, for all 
< i < n, di is dominated in T», where To = T 2 and Tj is the substructure of T 2 induced 
by T 2 \ {a , . . . , a^i} for i > 0. 

Lemma 4.6 ([UJ). Let T be a core relational structure. Then CSP(T) is first- order defin- 
able if and only if T 2 dismantles to the diagonal. 

Theorem 4.7. Let H be a graph. Then CSP(H L ) is first-order definable if and only if 
H has the following form: H is the disjoint union of two sets L and N such that (i) L 
is the set of loops of H and induces a complete graph, (ii) N is the set of non-loops of H 
and induces a graph with no edges, and (Hi) N = {x\, . . . ,x m } can be ordered so that the 
neighbourhood of Xi is contained in the neighbourhood o/xj+i for all 1 < i < m — 1. 

Proof. We first prove that conditions (i) and (ii) are necessary. Notice that if CSP(H L ) is 
first-order definable then so is CSP(K^) for any induced substructure K of H. Let x and 
y be distinct vertices of H and let K L be the substructure of H L induced by {x,y}. If x 
and y are non-adjacent loops, then 0(K) = {(x,x), (y,y)} the equality relation on {x,y}; if 
x and y are adjacent non-loops, then 0(K) = {(x,y), (y,x)}, the adjacency relation of the 
complete graph on 2 vertices. It is well known (and can be easily derived from Lemma |4.6|) 
that neither of these classes CSP(K L ) is first-order definable. It follows that the loops of 
H induce a complete graph and the non-loops induce a graph with no edges. 

Now we prove (iii) is necessary. Suppose for a contradiction that there exist distinct 
elements x and y oi N and elements n and m of L such that m is adjacent to x but not 
to y, and n is adjacent to y but not to x. Then CSP(G) is first-order definable, where G 
is the substructure of H L induced by {x,y,m, n}. By Lemma 14.61 G 2 dismantles to the 
diagonal. Then (x,y) must be dominated by one of (x,x), (y,x) or (y,y), since domination 
respects the unary relation {x,y} 2 (on G 2 ). But (m,n) is a neighbour of (x,y) and none of 
the other three, a contradiction. 

For the converse: we show that we can dismantle (H L ) 2 to the diagonal. Let x € H: 
then (x\,x) and (x, x\) are dominated by (x,x). Suppose that we have dismantled every 
element containing a coordinate equal to Xi with i < j — 1: if x is any element of H such 
that the elements (xj,x) and (x,Xj) remain, then either x is a loop or x = x^ with k > j; 
in any case the elements (xj,Xf.) and (xj.,Xj) are dominated by (x,x). In this way we can 
remove all pairs (x, y) with one of x or y a non-loop. For the remaining pairs, notice that 
if u and v are any loops then (u,v) is dominated (in what remains of (H L ) 2 ) by (u,u). m 

Finally, given a graph H, it can be decided in polynomial time which of the different 
cases delineated in Theorems 14.31 14.41 14.71 the list homomorphism problem for H satisfies. 
Indeed, it is known that bi-arc graphs can be recognised in polynomial time (see [14]) . 
Assume that H is a bi-arc graph: the forbidden substructure definition of the class C gives 
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an AC algorithm to recognise them; and those graphs whose list homomorphism problem 
is first-order definable can be recognised in polynomial time by results of |21| . 
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